IP ROUTING LOOKUP SCHEME AND SYSTEM 
FOR MULTI-GIGABIT SWITCHING ROUTERS 

BACKGROUND OF THE INVENTION 

A. Field of the Invention 

The present invention relates to an IP (Internet Protocol) routing lookup 
scheme and system for multi-gigabit switching routers, especially to an IP routing 
lookup scheme and system which can guarantee 3 memory accesses in the worst 
case using a memory size less than 512KB. Moreover, we can improve the 
performance of multi-gigabit switching routers greatly by pipeline skill in 
hardware. 



B. Description of the Prior Art 

15 The architecture of a multi-gigabit IP switching router is schematically shown 

in Fig. 1. It mainly includes a number of link interfaces 11, a CPU module 12, and 
a forwarding engine 13 interconnected with a switching fabric 14. The forwarding 
engine 13 employs a forwarding database which is a local version of the routing 
table downloaded from the CPU module 12 to make the routing decision. Although 

20 the routing updates may occur frequently, it is not necessary to dovmload a new 
forwarding database for each routing update. 

The CPU module 12 executes the routing protocols, such as RIP and OSPF, and 
needs a dynamic routing table for fast updates and fast generation of forwarding 
25 databases. For this reason, the forwarding database shall be optimized to fiimish 
fast lookups. 

The architecture of a forwarding engine 1 3 with superscalar scalar and pipeline 
design is shown in Fig. 2. An incoming IP packet will be buffered in the external 
30 data bus 21. The external data bus 21 is coupled to an internal data bus 22 for 
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processing the incoming IP packet in the forwarding module 20, The forwarding 
module 20 includes a route lookup module 23, a header verification module 24, a 
header update module 26 and a MAC address substitution module 25. The route 
lookup module 23 reads the destination IP address of the incoming packet from the 
5 internal data bus 22. The IP header of the incoming packet is forwarded to the 
header verification module 24 and the header update module 26 to process at the 
same time. If the IP header of the incoming packet is not correct, the packet will 
be dropped. The lookup is then terminated. If the IP header is correct, the header 
verification module 24 will send a signal to the header update module 26 to update 

10 the decreased value of TTL and the recalculated checksum of the incoming IP 
packet via the external data bus 21. The route lookup module 23 will then provide 
the next hop (port number) for the incoming IP packet. The MAC address 
substitution module 25 then substitutes the source MAC address and the destination 
MAC address of the incoming IP packet by the MAC address of the output interface 

15 11 and the immediate next hop (a router or the destination host) respectively. Then, 
the incoming IP packet can be forwarded into the output interface 11 via the 
external data bus 2 1 . 

According to the architecture as shown in Fig. 2, the bottleneck is in the route 
20 lookup module 23 because the header verification module 24, the header update 
module 26, and MAC address substitution module 25 all communicate with the 
route lookup module 23 to get the information for the forwarding next hop. The 
design of the rout lookup module 23 will significantly affect the packet forwarding 
rates and also the traffic on the networks. 

25 

Recent approaches on improving the packet forwarding rates shows that the IP 
lookup scheme is a tradeoff issue between memory size and access times. For 
instance, the most straightforward lookup scheme is to have a forwarding database 
containing every next hop for each 32-bit IP address. In this case, it requires only 
30 one memory access for IP address lookup. However, the Next Hop Array for an IP 
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address of 32-bits directly spread for exact matching will need 4 GB (2" - 4 GB). 

In another case, an indirect lookup approach is employed to reduce the size of 
forwarding database. As illustrated in Fig. 3, each IP address is partitioned into 

5 two parts: segment (16-bit) 3 1 and offset (16-bit) 34. The Segmentation Table (32- 
bit) 32 based on the segment 31 has the size of 64K for storing 65536 (2'*) entries. 
If the value of the entry of the Segmentation Table 32 is smaller than 256 (2*), then 
it records the next hop of the routing. If the value of the entry of the Segmentation 
Table 32 is larger than 255, then it stores a pointer pointing to the associated Next 

10 Hop Array 33 (hereinafter referred to as NHA). Each NHA 33 has the size of 64K 
(2'^). Each entry in the NHA 33 is 8-bit for recording the next hop (port number) 
of the destination IP address. Thus, for a destination IP address a.b.x.y, the 
segment a.b can be an index for looking up the Segmentation Table 32 while the 
offset x.y an index for looking up the associated NHA, if necessary. In other words, 

15 for a segment a.b, if the length of the longest prefix for this segment is less than or 
equal to 16 (the segment length), then the corresponding next hop can be found in 
the Segmentation Table 32 without having to access the associated 64KB NHA. 
On the other hand, if the length of the longest prefix is greater than 16, then a 
pointer in the Segmentation Table 32 can be found to point to an associated 64KB 

20 NHA 33. According to this routing lookup mechanism, the maximum number of 
memory accesses for an IP address lookup is two with reduced memory size. 
Nevertheless, its memory requirement is still too big. In the worst case, it requires 
memory space of 4 x 64 KB + 64K x 64KB for the segmentation table plus the 
NHA. 

25 

Some approach provides a software-based solution which can compress a table 
of 40,000 entries into 150-160 Kbytes. However, when this software is 
implemented in hardware, the memory accesses for a lookup is 2 in the best case and 
9 in the worst case. Another approach provides a large size DRAM for fast routing 
30 lookup. The maximum number of memory accesses for a lookup can be reduced to 
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2 but with a forwarding table of 33Mbytes. If an intermediate length table is added, 
the forwarding table can be reduced to 9 Mbytes, but the maximvim number of 
memory accesses for a lookup will be increased to 3. 

5 Another approach provides a lookup scheme based on the binary search 

mechanism. It requires a worst case time of /ogjCaddress bits) hash lookups. Thus, 5 
hash lookups are needed for IPv4 and 7 for IPv6 (128-bit). This software based 
binary search work is further improved by employing the cache structure and using 
the multiway and multicolumn search. For a database of N prefixes with address 

10 length W, the native binary search scheme takes 0(W*logN) searches. This 
improved schemes takes only 0(W+logN) searches. However, these software-based 
binary search schemes are not easy to be implemented in hardware. 



Accordingly, it is a primary object of the present invention to provide a fast 
lookup scheme and system which requires no more than three memory accesses for 
looking up the forwarding next hop and can be implemented in a forwarding table 
20 less than 5 1 2KB memory. 

It is another object of the present invention to provide a fast lookup scheme 
and system which can perform longest prefix matching and require only a 512KB 
memory implemented in a pipelined skill. 



Briefly described, the steps of the inventive method comprises: 

(a) partitioning each subnet IP address of a routing information into a 
segment and an offset with variable length; 

(b) building a segmentation table based on the segment, and the 
30 segmentation table comprises two fields: a pointer/next hop, and an offset length, 
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where from the value of the entry in the pointer/next hop field, we can determine if 
it is a next hop or a pointer, and the entry in the offset length field records the 
longest prefix length - S - 1 (S is the bit-length of the partitioned segment) for the 
corresponding segment; 
5 (c) constructing a plurality of Next Hop Arrays based on the offset with 

variable length; 

(d) determining if the NHA needs to be compressed firom the size of each 
NHA; if yes, then constructing a Compression Bit Map (CBM) for the NHA; 

(e) constructing a Compressed Next Hop Array (CNHA) according to each 

10 CBM; 

(f) converting each CNHA into a Code Word Array in which each code word 
comprises a base and a map; 

(g) generating a next hop for the incoming packet according to the following 

looking up steps: 

15 Using the segment of an incoming IP address as an index to look up the 

segmentation table. If the value of the first field (pointer/next hop) of the 
correspondent entry foimd in the segmentation table is less than a predetermined 
number, it indicates that the entry is a next hop. Otherwise, if the value of the first 
field of the correspondent entry foimd in the segmentation table is larger or equal to 

20 the predetermined number and the offset length in the correspondent entry plus 1 is 
smaller or equal to a threshold value, then it indicates that the entry is a pointer. 
So, following the pointer to find out the next hop in the Next Hop Array. And 
when the offset length in the correspondent entry plus 1 is larger than a threshold 
value, then it is a pointer pointing to a Code Word Array. In the following, the 

25 present invention provides a mechanism to get a code word (a base and map) of the 
Code Word Array. And the desired next hop can be found by decoding the code 
word. 

According to the method described above, the invention can compress a 
30 routing table which has about 40K entries into a forwarding table of only 450 to 470 
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Kbytes. The implementation cost can therefore be reduced to the minimum, and 
the speed for looking up the table can also be increased. A preferred embodiment 
of the present invention preferably includes: a segmentation table storage device, a 
Next Hop Array storage device, a Compressed Next Hop Array storage device, and 
5 a Code Word Array storage device. The first 16 bits of the IP address of an 
incoming packet will be used as an index to lookup the segmentation table storage 
device. When the first 20 bits of the correspondent entry is a next hop, then we 
can get the next hop right away. Otherwise, the entry is a pointer pointing to a 
Next Hop Array or Code Word Array. When the value of the last four bits of the 
10 correspond entry plus one is less or equal to 3, using the value of the last four bits 
plus one as an offset to look up the Next Hop Array storage device. On the other 
hand, when the value of the last four bits plus one is larger then 3, then decode the 
Code Word Array that is pointed to by the pointer to look for the next hop stored in 
the CNHA storage device. Thus, in the worst case, a next hop for a route prefix 
15 can be found in three memory accesses. Moreover, the architecture of the 
inventive system can be implemented in a superscalar scalar and pipeline design, 
thereby to increase the speed of finding a next hop. 



jQ BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects and advantages of the present invention will become 
apparent by reference to the following description and accompanying drawings 
wherein : 

25 

Fig. 1 is a schematic diagram showing the architecture of a conventional 
multi-gigabit switching router. 

Fig. 2 is a block diagram showing the architecture of a Forwarding Engine 
with superscalar and pipeline design. 
30 Fig. 3 is a schematic diagram showing the conventional method of indirect 
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lookup. 

Fig. 4 is a schematic diagram showing the method of indirect lookup with 
variable offset length according to the preferred embodiment of the invention. 

Fig. 5 is a flow chart showing the process for constructing a Next Hop Array 
5 according to the preferred embodiment of the present invention. 

Fig. 6A is a schematic diagram showing an example of the Segment 
presentation of a prefix set P according to the method of the invention. 

Fig. 6B is a schematic diagram showing the Next Hop Array based on the 
example shown in Fig. 6A. 
JO Fig. 7 A is a schematic diagram showing an example for NHA compression 

according to the method of the invention. 

Fig. 7B is a schematic diagram showing an example for a Compression Bit 
Map based on the Next Hop Array of Fig. 6A according to the method of the 
invention. 

15 Fig. 7C is schematic diagram showing a compressed NHA based on the 

Compression Bit Map as shovm in Fig. 7B according to the method of the present 
invention. 

Fig. 8 is a flow chart showing the process for constructing a Compression 
Bit Map and a compressed NHA according to the method of the present invention. 
20 Fig. 9 is a schematic diagram showing an example of the Code Word Array 

according to the method of the present invention. 

Fig. 10 is a block diagram of the hardware architecture showing the 
preferred embodiment of the present invention. 

Fig. 11 is flow chart showing the method for looking up a next hop 
25 according to the architecture as shown in Fig. 10. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
30 To further reduce the size of a NHA, the method of the present invention 
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provides an indirect lookup mechanism with variable offset length as shown in Fig. 
4. In contrast to the constant length of the offset in Fig. 3, the length of the offset 
depends on the longest prefix of an IP address. Thus, the length of the offset will 
be less or equal to 16. An IP address is partitioned into two fields: segment (16 bit) 

5 and offset (< 16 bit). The Segmentation table 42 based on the segment 41 is also 
64K entries. Each entry of the Segmentation Table 42 is 24 bits. Each entry is 
divided into two fields: pointer/next hop (20-bit) field and offset length (4-bit) field. 
The pointer/next hop field records a next hop or a pointer pointing to the associated 
NHA 43. If the pointer/next hop field contains a value less than 256 (8 bits), then 

10 it is a next hop. On the other hand, if the field contains a number larger than 255, 
then it contains a pointer pointing to a memory location indicated by the offset. 
The value of the offset length field (4-bit) + 1 indicates the length of the offset 
which is k bits (0 < A; < 16). If the offset length field is of k-hit length, its 
associated NHA will have 2* entries. Since k is less or equal to 16, so the offset of 

1 5 variable length can further save the memory space for NHA. 

The offset length depends on the prefixes of each segment. For instance, a 
segment of an IP address a.b may have m prefixes. The longest prefix / is larger 
than 1 6 bits but less or equal to 32 bits. The offset length k for this segment will be 
20 (/-1 6) bits. In other words, for each destination IP address a.b.x.y, the segment a.b 
performs as the index for looking up the Segmentation Table 42 and the leftmost k 
bits of x.y (from 16-th bit to (16+k-l)-th bit) as the index for looking up the 
associated NHA. 

25 The construction of the NHA depends on the prefix-set of the segment and the 

length of each prefix in the prefix-set. Refer to Fig. 5 for the steps of constructing 
the NHA of a segment: 

Step 501 : Start and read the set of route prefixes of a segment. 
30 Step 502: Let //and /i, be the length and next hop of a route prefix p;. 
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respectively. Let P= {po,P„ Pm-i) be the set of sorted prefixes of a segment. 
Thus, for any pair of prefixes pi and pj, i < j if and only if /, ^ /, . 

Step 503: After the set P has been sorted, for each prefix p, in P, calculate Sf 
and Ef in the memory address. 
5 Step 504: For each element in the set P, assign the next hop h, of a prefix to 

each corresponding memory address j of the Next Hop Array {NHAJ), where / is 
from 0 to w-1 and ma{ Sf) <j <ma{ Ef ). 

The object of these steps is to rearrange the route prefixes according to their 
10 orders in the set P in the memory. Suppose an IP address has a segment which 
consists of multiple route prefixes representing multiple subnets. Each route 
prefix p, can be assigned with a next hop /i,. Let o, = /, - 16 and k = max{ o, | e 
P } (NHAis of size 2*). 

15 Let P = {po. Pi, Pm-i) the set of sorted prefixes of a segment. Thus, for 

any pair of prefixes /7, and pj, i is less than; if and only if /, is less or equal to /, . 

For each prefix p^ in P, let Sf and Ef denote the data structure of the start 
point and end point of prefix p„ respectively. Moreover, let ma{Sf) and ma{Ef) 
20 be the memory addresses of Sf and Ef in the NHA, respectively. The addresses 
ranging from start address ma{Sf) and end address ma^Ef) of prefix p, should be 
forwarded to next hop /j,. Also let op{Sf) and op{Ef) be the output ports (next 
hops) of the destination addresses of the start point and the end point, respectively. 

25 Assume prefix p, of an IP address be a. b.x.y. Let Xo,x,,X2, ...,x,s represent the 

binary form of x;^, and So,s,,S2, ...,Sk.i the start address mask, where Sj = \,j < o, 
and Sj = 0, j > o,., and eo,e],e2, ...,6^., the end address mask, where = 0,j < 
Oi and ej= l,J > o, . Thus, 

30 ma(Sf) = iXo,x„X2, ...,Xk., AND So,s,,S2. ...,Sk-i \ and 
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ma{ Ef ) = {xo,x„X2, . . . .x^.j OR eg, e,, e^, . . e^.y). 

For example, assume jf, = a.b.58.0, /,• = 26, and A: = 12 (the longest prefix in 
this segment is 28 bits). Then, the binary form of 58.0 (A:-bit) = 001110100000, 
5 5o, 5 „ 5^. — -s*. / = 1 1 1 1 1 1 1 1 1 1 ^0. ey> ^2. — Sfc./ = 0000000000 1 1 . We have 

ma(5P) = 001110100000 = 928 and 
ma{E?) = OOlUOlOOOii = 931. 

1 0 This also means that NHA, = /?„ ma{ S° ) <j <ma( Ef ). 

For each prefix in P, we can find a pair of Sf and Ef . The memory 
addresses between ma{Sf) and ma{Ef) can be depicted as an interval 
ma{ Ef )], and the set P of prefixes can be presented as a set of intervals. 

15 

If none of the intervals is overlapped, then we can construct the NHA 
directly by setting NHA, = h, ma{ Sf )<j <ma{ Ef ). However, this may not always 
be the case in practical application. An overlap of intervals means there are more 
than one matching IP addresses. 

20 

Since the invention adopts the longest matching for the IP addresses, 
therefore if a memory address j belongs to a set P ' of intervals simultaneously, then 
we should set NHA^ = h^, where is the longest prefix of P '. For example, assume 
each route prefix is presented in a format like: prefix/prefix length/next hop (output 
25 port). Then the set P of six sorted prefixes {192.168/16/1, 192.168.58/18/2, 
192.168.92/24/1, 192.168.58.32/26/3, 192.168.255.240/28/5, 192.168.58.36/32/8} 
can be presented as the six segments shovra in Fig. 6A. 

Refer to Fig. 6A, the longest prefix 192.168.58.36 is a complete IP address, so 
30 it represents a point instead of an interval. It indicates that an IP address 
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corresponding to "192.168.58.36" will be forwarded to next hop 8. On the other 
hand, the prefix "192.168" represents a shortest prefix. It indicates any IP address 
in the subnet of 1 92. 1 68 will fall within this range. If we project all the prefixes on 
the prefix Pq, we can find prefixes containing the same subnets will have overlapped 
parts. However, we can find the desired next hop with the longest prefix matching. 
The memory addresses for these prefixes are as follows: 



ma(Sl)- 

ma(Sl)-- 
ma(Sl)-- 
ma(Sl)-- 
ma(S°)-- 



-0; 
■0; 

-23552; 
14848; 
-65520; 
-- 14884; 



ma(El)-- 
ma(Ef)- 
ma(El)-- 
ma( El) -- 
ma(El)-- 
ma(El)- 



--65535 
16383 
--23807 
-14911 
-65535 
14884. 



Their corresponding Next Hop Array will be like the array as illustrated in Fig. 
6B. The Next Hop Array 61 contains memory locations from 0 to 65535. Each 
location stores an output port for an IP address in the sorted prefixes. If the 
prefixes have overlapped portions, the NHA stores the output port for the longest 
prefix. Accordingly, the memory addresses from 0 to 14847 represent output ports 
for P„ 14848-14883 for P3, 14884 for P5, 14885-14911 for P3, and so on. Since the 
output ports for the IP addresses of the same prefixes will be the same, so the Next 
Hop Array as illustrated in Fig. 6B or 7A contains many duplicated port numbers. 
In the worst case, the number of memory accesses for an IP address lookup is still 
two for this NHA construction, but the total amount of memory required can be 
significantly reduced because its offset length is variable. 

The size of the forwarding database structure, that is NHAs, can be fiirther 
reduced by compression. For each segment with offset length ^ > 3, the associated 
NHA can be replaced by a Code Word Array (hereinafter referred to as CWA) and a 
compressed NHA (hereinafter referred to as CNHA). To construct the CWA, the 
technique of Compression Bit Map (hereinafter referred to as CBM) is employed. 
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one bit for each entry in the original NHA. The compression rule is as follows: 

Let Qi denote the value (port number) of the /-th entry of the NHA, 6, stand for 
the corresponding bit in the CBM, and denote the value (port number) of they-th 

5 entry of the CNHA. Initially, = a^, 6^ = 1 . and y = 1 . Then scan the NHA from 
left to right. If a,,; = a, , then = 0, else 6,^, = 1, Cj = a,,y, and j = 
Following this process, every first occurrence of a port number of a prefix in the 
NHA will be marked as "1" in the CBM (CBM). For example, the first occurrence 
of "2", "8", "7", "6", "2" in the NHA as shovm in Fig. 7 A can be marked by a 

10 CBM as shown in Fig. 7B. The rest are marked as "0" in the CBM, 
Consequently, combining the NHA as shown in Fig. 7A and CBMA as shovm in Fig. 
7B can get the Compressed NHA (CNHA) as shown in Fig. 7C. The CBM 
illustrated in Fig. 7B later on can be used to decode the CNHA as illustrated in Fig. 
7C. 

15 

In addition to the method stated above for constructing the CBM and CNHA, 
the present invention also provides another method which can construct the CBM 
and CNHA of a segment directly without constructing the NHA first. The method 
is illustrated in Fig. 8. 

20 

Step 801: Start and read the set P of route prefixes of a segment ' P={Po. 
Pi. Pm-J. where each element in the set is sorted in an increasing order by the 
length of prefixes. Each pair of start point and end point of each route prefix is 
sorted according to their order in set P. The sorted list will be L = 

25 {Sq^Eq ^Sy 5 iS",, , }• 

Step 802: Sort elements in the list Z in an increasing order according to their 
memory addresses in the segment. If two elements have the same memory address, 
then refer to their sequential orders as in the list L, 

30 
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Step 803: initialize an array A = (p and stack C= (p. 

Step 804: Process the elements in the list L from left to right and for each 
element executes the following steps 8041 to 8046: 

Step 8041: Check if the selected element is a start point 5f? "i" represents 
5 the route prefix. If yes, go to step 8042. If not, go to step 8043. 

Step 8042: Push Sf onto stack C. Append 5° to array A. Step 8041 to step 
8046 are finished. Repeat step 8041 to step 8046 until each element has been 
processed. 

Step 8043: Remove the top element fi-om stack C. 
10 Step 8044: Check if the top element of stack C is 5* ? "S* " means that the 

start point of the j-th route prefix in the set has been updated k times in the memory. If 
yes, go to step 8045. If not, go to step 8046. 

Step 8045: Append S)*' to A, where op(S)*') = op(S]), ma(S)*') = 
ma(Ef)+\. And Replace the top element of stack C with 5*+'. Step 8041 to step 
15 8046 are finished. Repeat step 8041 to 8046 until all the elements have been 
processed. 

Step 8046: Do nothing. Step 8041 to step 8046 are finished. Repeat step 
8041 to step 8046 until all the elements have been processed. 

Step 805: Compact the array A such that for consecutive elements 5) and 

20 SI, removes* fi-om array A if ma(S)) = ma(Sl), remove SI from array A if 
op(S)) = op(Sl). 

Step 806: Remove each element S] from array A where ma{S)) > 
ma{ES). 

Step 807: For each start point in the array A, assign "1" to the corresponding 
25 bit of the Compression Bit Map, and assign its output port to the corresponding 
entry of the Compressed Next Hop Array. 
Step 808: Stop. 

The time complexity of the proposed CBM and CNHA constructing method 
30 is 0{nlogn), where n is the number of prefixes in a segment. Since this algorithm 
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constructs the CBMs and CNHAs directly from the given prefixes, the forwarding 
table can be built in a very short time. 

After obtaining the CNHA, the CBM can not directly be used to decode the 
5 CNHA for looking up the output port for each IP address. The CBM should be 
encoded as a sequence of code words (hereinafter referred to as CWA). The length 
of the code words depends on application. According to the preferred embodiment 
of the present invention, a code word of 32-bit is used. However, a code word of 
16-bit or any suitable numbers may also be used. 

10 

Refer to Fig. 9, it shows a CWA 96 followed by a CNHA 95. The CBM is 
treated as a bit-stream and partitioned into a sequence of 16-bit maps. These maps 
are sequentially put into the code words 98, 99, one for each code word. The CWA 
96 consists of multiple code words. Each code word consists of a map (16-bit) 91 

15 and a base (16-bit) 92. Refer to Fig. 9, the base of each code word is equal to the 
number of "T's accumulated in the maps of previous code words. It indicates the 
number of occurrences for different port nimibers up to the code word just foimd. 
For example, the CBM shovm in Fig. 7B is encoded as the Code Word Array 96 
depicted in Fig. 9. The maps 91, 93 of the first two code words 98, 99 are 

20 "1000000010000000" and "0000000010001000", respectively. For the first code 
word 98, the base 92 has a value of zero because it is the first code word. For the 
second code word 99, the base 94 has a value of "2" because its previous map 91 
has two "l"s. 

25 Accordingly, the base of each code word is used to indicate the start entry of 

the associated CNHA. For an offset value q, the output port can be computed as 
follows: Let cw,= map, + base, be the code word containing this offset, where s = {q 
DIV 16). Let w = {q MOD 16) denote the corresponding bit of q in map, and 1^1 
represent the number of accumulated "l"s from the 0-th bit to the w-th bit of map, 

30 Then, the output port of an offset value q can be calculated as 
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op^ = CHNA,, where t = base, + . 

Take the examples shown in Figs 7B, 7C and Fig. 9 again for illustration. • For 
5 offset ^ = 8, we have s = 0, w = 8, and \w\ = 2, then t = ibaseo+\w\-Y) = 0+2-1 = 1 and 
the corresponding output port is the second entry in CNHA, which is port 8. For 
offset value q = 25, we have 5 = 1, w = 9, and lw| = 1, then t = {base ,+\w\-\') = 2+1-1 
= 2. The corresponding output port is the third entry in CNHA2 which is port 7. 

10 To update the forwarding table, we can either rebuild a new one in a short time 

or through special hardware design, such as dual-port memory or dual-memory 
banks. 

The high-level hardware implementation according to the preferred 
1 5 embodiment of the present invention is shown in Fig. 1 0. Refer to Fig. 1 0, the first 
16 bits (bits 0-15) of an incoming IP address 101 are used as an index to look up the 
Segment Table 102 which is 64K. Each entry of the Segmentation Table storage 
device 102 has a length of 24 bits. The corresponding entry of the Segment Table 
storage device 102 records either the next hop (the value of left 20 bits < 256) of 
20 this destination IP address, or a pointer (20-bit) pointing to the starting address of 
the CWA storage device 104 or the NHA storage device 105, and an offset length of 
the actual offset length minus 1 (k-1, 4-bit). For each segment, if the offset length 
A: > 3, then we need to decode the CNHA storage device 108 by searching the CWA 
storage device 104 (with 2*"^ entries) and finding the associated code word. To 
25 decode the code word, using the 16* bit to the k+15* bit of the destination IP 
address as an index to look up the code word in the Code Word Array. From the 
code word found, the map and base can indicate the location of the port number in 
the CNHA storage device 108. 

30 Since the CNHA storage device 108 is located immediately after the CWA 



15 



storage device 104, the starting address of CNHA storage device 108 is equal to 
( the pointer + 2*"^x4 - 1 ) .An adder 107 is designed to add this v/ith base and |w|. 

If A: < 3 (the offset length less than 4 bits), then the k bits, starting from the 16- 
5 th bit of the destination IP address 101, are used as the index of the NHA storage 
device 105 (with 2* entries) to find the output port. 

The value of |vi;| can be computed by a parallel adder 106. For example, assume 
for a segment with offset length A: is 8 and a destination IP address is ab.l77,y with 
10 offset of 177. Then we should search the s-th code word, where s is equal to 11 
(177 DIV 16 ). Assume the map of this code word is IQOO 1000 11 000 100, then the 
bit position for this offset will be 1 (177 MOD 16). 

Let Bf denote the bit stream from the J-th bit to /-th bit of an IP address 101 
15 and V{B/) stand for the value of bit stream Bf\ To compute the value of we 
can first mask the right l6-V(B^t\5)'^ bits of the code word into zero and then 
calculate the number of "T's in this masked code word by the parallel adder 106 in 
constant time. 

20 According to the architecture as shown in Fig. 10, the next hop can be found by 

following the steps as shown in Fig. 1 1 . Refer to Fig. 1 1 , 

Step 1101: Start. Here an IP address of 32 bits is used as an example for 
illustration. 

25 Step 1102: Let B/ represent the jth to ith bits of an IP address. V{B/) be 

the value of B/ . Use F(5° ) as an index to look up the entry in the segmentation 
table. 

Step 1103: Determine if the leftmost 20 bits of the corresponding entry is 
larger than 255? If yes, go to step 1 105. If not, go to step 1 104. 
30 Step 1104: Since the entry is an output port, so get the next hop value 
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directly from the value of the leftmost 20 bits of the corresponding entry. And go to 
step 1111. 

Step 1105: Determine if the value of the rightmost 4 bits of the 
corresponding entry is larger than the value of 3 bits? If yes, go to step 1107. If 
5 not, go to step 1106. 

Step 1106: Use V(Bll^^) as an index to look up the Next Hop Array. And 

go to step 1111. 

Step 1107: The coresponding entry is a pointer. So, use the pointer + 
as an index to find the corresponding code word from the Code Word 
10 Array of the corresponding segment. 

Step 1 108: Input two data Map and Mask m bits into the parallel adder to get 
the value of \w\ where \w\ means the number of "l"s accumulated from the 0-th bit to 
the W'th bit. 

Step 1109: Compute the index for looking up the CNHA by adding (pointer 
15 + 2* '^x4 - 1), Base and 

Step 1110: Lookup the next hop value from the CNHA according to the 
index computed in step 1109. 

Step 1111: output the next hop. 

20 For current ASIC technology, the parallel adder 106 in Fig. 10 takes no more 

than 8 ns. In Fig. 10, the length of the code word is 32-bit and is identical to that 
of the data bus. In other words, it only needs one memory access to obtain a code 
word in the CWA storage device 104. For a destination IP address a.b.x.y, the 
segment a.b (ViB^s)) is first used as the index to look up the Segmentation Table 

25 storage device 102. If the offset length k of this segment is less than or equal to 3, 
then we can obtain the output port from the NHA 1 05 directly. In this case, it takes 
two memory accesses. If the offset length k is greater than 3, then we use 
pointer+V(Bll^^) as the index to lookup the CWA storage device 104 to get the 
desired code word. Based on the obtained map and base of the code word, we can 

30 compute the address of the output port in the CNHA storage device 108, In this 
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case, it takes three memory accesses to get the output port. 

To sum up, the lookup scheme according to the preferred embodiment of the 
invention also provide a forwarding table having the size ranging from 450Kbytes to 
5 470 Kbytes. Moreover, most of the lookup can be done by only one memory 
access. In the worst case, the number of memory accesses required for a lookup is 
three. When implemented in a pipeline skill in hardware, the preferred 
embodiment of the invention as shown in Fig. 10 can achieve one routing lookup for 
each memory access. This small forwarding table is very suitable to be 

10 implemented in faster SRAM. With current 10ns SRAM, this mechanism 
fumishes approximately 100x10^ routing lookups per second. The implementation 
cost is also very low. With current 10ns SRAM, the present invention fumishes 
approximately 100x10^ routing lookups per second. This speed is much faster than 
any routing lookup schemes available on the market. Moreover, based on the 

15 proposed algorithm, the CBM and CNHA of a segment can be constructed in 
O(nlogn) time; where n is the number of prefixes in the segment. Thus, The 
invention can update the forwarding table in a quick and efficient way. Moreover, 
since the forwarding table can be constructed in a short time, a new forwarding table 
can be rebuilt whenever necessary or be built based on the hardware of dual-port 

20 memory or dual-memory banks. 

While this invention has been described with reference to an illustrative 
embodiment, this description is not intended to be construed in a limiting sense. 
Various modifications and combinations of the illustrative embodiment, as well as 
25 other embodiments of the invention, will be apparent to persons skilled in the art 
upon reference to the description. It is therefore intended that the appended 
claims encompass any such modifications or embodiments. 
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